Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 3333 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 364.7 KiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 2 |
day_charge is highly correlated with total_charge | High correlation |
total_charge is highly correlated with day_charge and 1 other fields | High correlation |
churn is highly correlated with total_charge | High correlation |
voice_mail_messages has 2411 (72.3%) zeros | Zeros |
customer_service_calls has 697 (20.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-05 10:57:13.127100 |
|---|---|
| Analysis finished | 2022-11-05 10:57:54.337702 |
| Duration | 41.21 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
account_length
Real number (ℝ≥0)
| Distinct | 212 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.0648065 |
| Minimum | 1 |
|---|---|
| Maximum | 243 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 74 |
| median | 101 |
| Q3 | 127 |
| 95-th percentile | 167 |
| Maximum | 243 |
| Range | 242 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 39.82210593 |
|---|---|
| Coefficient of variation (CV) | 0.3940254508 |
| Kurtosis | -0.1078359806 |
| Mean | 101.0648065 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.09660629423 |
| Sum | 336849 |
| Variance | 1585.800121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 43 | 1.3% |
| 87 | 42 | 1.3% |
| 101 | 40 | 1.2% |
| 93 | 40 | 1.2% |
| 90 | 39 | 1.2% |
| 95 | 38 | 1.1% |
| 86 | 38 | 1.1% |
| 100 | 37 | 1.1% |
| 116 | 37 | 1.1% |
| 112 | 36 | 1.1% |
| Other values (202) | 2943 |
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 2 | 1 | < 0.1% |
| 3 | 5 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 3 | 0.1% |
| 10 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 243 | 1 | < 0.1% |
| 232 | 1 | < 0.1% |
| 225 | 2 | |
| 224 | 2 | |
| 221 | 1 | < 0.1% |
| 217 | 2 | |
| 215 | 1 | < 0.1% |
| 212 | 2 | |
| 210 | 2 | |
| 209 | 3 |
| Distinct | 46 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.099009901 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 2411 |
| Zeros (%) | 72.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 20 |
| 95-th percentile | 36 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.68836537 |
|---|---|
| Coefficient of variation (CV) | 1.690128243 |
| Kurtosis | -0.05112853879 |
| Mean | 8.099009901 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.264823634 |
| Sum | 26994 |
| Variance | 187.3713466 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2411 | |
| 31 | 60 | 1.8% |
| 29 | 53 | 1.6% |
| 28 | 51 | 1.5% |
| 33 | 46 | 1.4% |
| 27 | 44 | 1.3% |
| 30 | 44 | 1.3% |
| 24 | 42 | 1.3% |
| 26 | 41 | 1.2% |
| 32 | 41 | 1.2% |
| Other values (36) | 500 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 2411 | |
| 4 | 1 | < 0.1% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 2 | 0.1% |
| 12 | 6 | 0.2% |
| 13 | 4 | 0.1% |
| 14 | 7 | 0.2% |
| 15 | 9 | 0.3% |
| Value | Count | Frequency (%) |
| 51 | 1 | < 0.1% |
| 50 | 2 | 0.1% |
| 49 | 1 | < 0.1% |
| 48 | 2 | 0.1% |
| 47 | 3 | 0.1% |
| 46 | 4 | 0.1% |
| 45 | 6 | 0.2% |
| 44 | 7 | |
| 43 | 9 | |
| 42 | 15 |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.562856286 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 697 |
| Zeros (%) | 20.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.315491045 |
|---|---|
| Coefficient of variation (CV) | 0.8417223368 |
| Kurtosis | 1.730913655 |
| Mean | 1.562856286 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.091359482 |
| Sum | 5209 |
| Variance | 1.730516689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1181 | |
| 2 | 759 | |
| 0 | 697 | |
| 3 | 429 | 12.9% |
| 4 | 166 | 5.0% |
| 5 | 66 | 2.0% |
| 6 | 22 | 0.7% |
| 7 | 9 | 0.3% |
| 9 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 697 | |
| 1 | 1181 | |
| 2 | 759 | |
| 3 | 429 | 12.9% |
| 4 | 166 | 5.0% |
| 5 | 66 | 2.0% |
| 6 | 22 | 0.7% |
| 7 | 9 | 0.3% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| 7 | 9 | 0.3% |
| 6 | 22 | 0.7% |
| 5 | 66 | 2.0% |
| 4 | 166 | 5.0% |
| 3 | 429 | 12.9% |
| 2 | 759 | |
| 1 | 1181 | |
| 0 | 697 |
international_plan
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3333 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3010 | |
| 1 | 323 | 9.7% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3010 | |
| 1 | 323 | 9.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3010 | |
| 1 | 323 | 9.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3333 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3010 | |
| 1 | 323 | 9.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3333 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3010 | |
| 1 | 323 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3333 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3010 | |
| 1 | 323 | 9.7% |
day_calls
Real number (ℝ≥0)
| Distinct | 119 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.4356436 |
| Minimum | 0 |
|---|---|
| Maximum | 165 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 101 |
| Q3 | 114 |
| 95-th percentile | 133 |
| Maximum | 165 |
| Range | 165 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 20.06908421 |
|---|---|
| Coefficient of variation (CV) | 0.1998203376 |
| Kurtosis | 0.2431815246 |
| Mean | 100.4356436 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.111786639 |
| Sum | 334752 |
| Variance | 402.7681409 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 102 | 78 | 2.3% |
| 105 | 75 | 2.3% |
| 95 | 69 | 2.1% |
| 107 | 69 | 2.1% |
| 104 | 68 | 2.0% |
| 108 | 67 | 2.0% |
| 97 | 67 | 2.0% |
| 106 | 66 | 2.0% |
| 112 | 66 | 2.0% |
| 110 | 66 | 2.0% |
| Other values (109) | 2642 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 30 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 40 | 2 | |
| 42 | 2 | |
| 44 | 3 | |
| 45 | 3 | |
| 47 | 2 | |
| 48 | 3 |
| Value | Count | Frequency (%) |
| 165 | 1 | < 0.1% |
| 163 | 1 | < 0.1% |
| 160 | 1 | < 0.1% |
| 158 | 3 | |
| 157 | 1 | < 0.1% |
| 156 | 1 | < 0.1% |
| 152 | 1 | < 0.1% |
| 151 | 5 | |
| 150 | 6 | |
| 149 | 1 | < 0.1% |
| Distinct | 1667 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.56230723 |
| Minimum | 0 |
|---|---|
| Maximum | 59.64 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 15.288 |
| Q1 | 24.43 |
| median | 30.5 |
| Q3 | 36.79 |
| 95-th percentile | 46.028 |
| Maximum | 59.64 |
| Range | 59.64 |
| Interquartile range (IQR) | 12.36 |
Descriptive statistics
| Standard deviation | 9.259434554 |
|---|---|
| Coefficient of variation (CV) | 0.3029690947 |
| Kurtosis | -0.01981178724 |
| Mean | 30.56230723 |
| Median Absolute Deviation (MAD) | 6.17 |
| Skewness | -0.02908326834 |
| Sum | 101864.17 |
| Variance | 85.73712826 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.18 | 8 | 0.2% |
| 27.12 | 8 | 0.2% |
| 29.67 | 8 | 0.2% |
| 31.18 | 7 | 0.2% |
| 29.82 | 7 | 0.2% |
| 27.59 | 7 | 0.2% |
| 30.38 | 6 | 0.2% |
| 33.12 | 6 | 0.2% |
| 32.18 | 6 | 0.2% |
| 24.87 | 6 | 0.2% |
| Other values (1657) | 3264 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 0.44 | 1 | |
| 1.33 | 1 | |
| 1.34 | 1 | |
| 2.13 | 1 | |
| 2.99 | 1 | |
| 3.21 | 1 | |
| 3.32 | 1 | |
| 4.4 | 1 | |
| 4.59 | 1 |
| Value | Count | Frequency (%) |
| 59.64 | 1 | |
| 58.96 | 1 | |
| 58.7 | 1 | |
| 57.36 | 1 | |
| 57.04 | 1 | |
| 56.83 | 1 | |
| 56.59 | 1 | |
| 56.07 | 1 | |
| 55.78 | 1 | |
| 55.51 | 1 |
evening_calls
Real number (ℝ≥0)
| Distinct | 123 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.1143114 |
| Minimum | 0 |
|---|---|
| Maximum | 170 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 100 |
| Q3 | 114 |
| 95-th percentile | 133 |
| Maximum | 170 |
| Range | 170 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 19.92262529 |
|---|---|
| Coefficient of variation (CV) | 0.1989987746 |
| Kurtosis | 0.206156468 |
| Mean | 100.1143114 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.05556313904 |
| Sum | 333681 |
| Variance | 396.9109986 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 80 | 2.4% |
| 94 | 79 | 2.4% |
| 108 | 71 | 2.1% |
| 102 | 70 | 2.1% |
| 97 | 70 | 2.1% |
| 88 | 69 | 2.1% |
| 101 | 68 | 2.0% |
| 109 | 67 | 2.0% |
| 98 | 66 | 2.0% |
| 111 | 65 | 2.0% |
| Other values (113) | 2628 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 46 | 3 | |
| 48 | 6 |
| Value | Count | Frequency (%) |
| 170 | 1 | < 0.1% |
| 168 | 1 | < 0.1% |
| 164 | 1 | < 0.1% |
| 159 | 1 | < 0.1% |
| 157 | 1 | < 0.1% |
| 156 | 1 | < 0.1% |
| 155 | 3 | |
| 154 | 2 | 0.1% |
| 153 | 1 | < 0.1% |
| 152 | 6 |
evening_charge
Real number (ℝ≥0)
| Distinct | 1440 |
|---|---|
| Distinct (%) | 43.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.08354035 |
| Minimum | 0 |
|---|---|
| Maximum | 30.91 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.1 |
| Q1 | 14.16 |
| median | 17.12 |
| Q3 | 20 |
| 95-th percentile | 24.17 |
| Maximum | 30.91 |
| Range | 30.91 |
| Interquartile range (IQR) | 5.84 |
Descriptive statistics
| Standard deviation | 4.310667643 |
|---|---|
| Coefficient of variation (CV) | 0.2523287067 |
| Kurtosis | 0.02548740481 |
| Mean | 17.08354035 |
| Median Absolute Deviation (MAD) | 2.92 |
| Skewness | -0.02385798901 |
| Sum | 56939.44 |
| Variance | 18.58185553 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.25 | 11 | 0.3% |
| 16.12 | 11 | 0.3% |
| 15.9 | 10 | 0.3% |
| 17.09 | 9 | 0.3% |
| 18.62 | 9 | 0.3% |
| 17.99 | 9 | 0.3% |
| 14.44 | 9 | 0.3% |
| 18.96 | 8 | 0.2% |
| 16.35 | 8 | 0.2% |
| 16.97 | 8 | 0.2% |
| Other values (1430) | 3241 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2.65 | 1 | |
| 3.59 | 1 | |
| 3.61 | 1 | |
| 3.73 | 1 | |
| 4.09 | 1 | |
| 4.18 | 1 | |
| 4.5 | 1 | |
| 4.76 | 1 | |
| 4.98 | 1 |
| Value | Count | Frequency (%) |
| 30.91 | 1 | |
| 30.75 | 1 | |
| 30.11 | 1 | |
| 29.89 | 1 | |
| 29.83 | 1 | |
| 29.79 | 1 | |
| 29.62 | 1 | |
| 29.52 | 1 | |
| 29.01 | 1 | |
| 28.89 | 1 |
night_calls
Real number (ℝ≥0)
| Distinct | 120 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.1077108 |
| Minimum | 33 |
|---|---|
| Maximum | 175 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 68 |
| Q1 | 87 |
| median | 100 |
| Q3 | 113 |
| 95-th percentile | 132 |
| Maximum | 175 |
| Range | 142 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.56860935 |
|---|---|
| Coefficient of variation (CV) | 0.1954755452 |
| Kurtosis | -0.07201957894 |
| Mean | 100.1077108 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.03249957015 |
| Sum | 333659 |
| Variance | 382.9304717 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 84 | 2.5% |
| 104 | 78 | 2.3% |
| 91 | 76 | 2.3% |
| 102 | 72 | 2.2% |
| 100 | 69 | 2.1% |
| 106 | 69 | 2.1% |
| 98 | 67 | 2.0% |
| 94 | 66 | 2.0% |
| 103 | 65 | 2.0% |
| 95 | 64 | 1.9% |
| Other values (110) | 2623 |
| Value | Count | Frequency (%) |
| 33 | 1 | |
| 36 | 1 | |
| 38 | 1 | |
| 42 | 2 | |
| 44 | 1 | |
| 46 | 1 | |
| 48 | 1 | |
| 49 | 2 | |
| 50 | 2 | |
| 51 | 2 |
| Value | Count | Frequency (%) |
| 175 | 1 | < 0.1% |
| 166 | 1 | < 0.1% |
| 164 | 1 | < 0.1% |
| 158 | 1 | < 0.1% |
| 157 | 2 | |
| 156 | 2 | |
| 155 | 2 | |
| 154 | 2 | |
| 153 | 3 | |
| 152 | 3 |
night_charge
Real number (ℝ≥0)
| Distinct | 933 |
|---|---|
| Distinct (%) | 28.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.039324932 |
| Minimum | 1.04 |
|---|---|
| Maximum | 17.77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1.04 |
|---|---|
| 5-th percentile | 5.316 |
| Q1 | 7.52 |
| median | 9.05 |
| Q3 | 10.59 |
| 95-th percentile | 12.73 |
| Maximum | 17.77 |
| Range | 16.73 |
| Interquartile range (IQR) | 3.07 |
Descriptive statistics
| Standard deviation | 2.275872838 |
|---|---|
| Coefficient of variation (CV) | 0.2517746463 |
| Kurtosis | 0.08566317984 |
| Mean | 9.039324932 |
| Median Absolute Deviation (MAD) | 1.54 |
| Skewness | 0.008886236769 |
| Sum | 30128.07 |
| Variance | 5.179597173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.66 | 15 | 0.5% |
| 9.45 | 15 | 0.5% |
| 8.47 | 14 | 0.4% |
| 8.88 | 14 | 0.4% |
| 7.69 | 13 | 0.4% |
| 8.64 | 12 | 0.4% |
| 10.8 | 11 | 0.3% |
| 10.49 | 11 | 0.3% |
| 10.35 | 11 | 0.3% |
| 8.57 | 11 | 0.3% |
| Other values (923) | 3206 |
| Value | Count | Frequency (%) |
| 1.04 | 1 | |
| 1.97 | 1 | |
| 2.03 | 1 | |
| 2.13 | 1 | |
| 2.25 | 2 | |
| 2.4 | 1 | |
| 2.43 | 1 | |
| 2.45 | 1 | |
| 2.55 | 1 | |
| 2.59 | 1 |
| Value | Count | Frequency (%) |
| 17.77 | 1 | |
| 17.19 | 1 | |
| 16.99 | 1 | |
| 16.55 | 1 | |
| 16.42 | 1 | |
| 16.39 | 1 | |
| 15.97 | 1 | |
| 15.86 | 1 | |
| 15.85 | 1 | |
| 15.76 | 1 |
international_calls
Real number (ℝ≥0)
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.479447945 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.461214271 |
|---|---|
| Coefficient of variation (CV) | 0.5494458917 |
| Kurtosis | 3.083588982 |
| Mean | 4.479447945 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.321478166 |
| Sum | 14930 |
| Variance | 6.057575686 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 668 | |
| 4 | 619 | |
| 2 | 489 | |
| 5 | 472 | |
| 6 | 336 | |
| 7 | 218 | 6.5% |
| 1 | 160 | 4.8% |
| 8 | 116 | 3.5% |
| 9 | 109 | 3.3% |
| 10 | 50 | 1.5% |
| Other values (11) | 96 | 2.9% |
| Value | Count | Frequency (%) |
| 0 | 18 | 0.5% |
| 1 | 160 | 4.8% |
| 2 | 489 | |
| 3 | 668 | |
| 4 | 619 | |
| 5 | 472 | |
| 6 | 336 | |
| 7 | 218 | 6.5% |
| 8 | 116 | 3.5% |
| 9 | 109 | 3.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 3 | 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 2 | 0.1% |
| 15 | 7 | 0.2% |
| 14 | 6 | 0.2% |
| 13 | 14 | |
| 12 | 15 | |
| 11 | 28 |
international_charge
Real number (ℝ≥0)
| Distinct | 162 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.764581458 |
| Minimum | 0 |
|---|---|
| Maximum | 5.4 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.54 |
| Q1 | 2.3 |
| median | 2.78 |
| Q3 | 3.27 |
| 95-th percentile | 3.97 |
| Maximum | 5.4 |
| Range | 5.4 |
| Interquartile range (IQR) | 0.97 |
Descriptive statistics
| Standard deviation | 0.7537726127 |
|---|---|
| Coefficient of variation (CV) | 0.2726534284 |
| Kurtosis | 0.6096104298 |
| Mean | 2.764581458 |
| Median Absolute Deviation (MAD) | 0.48 |
| Skewness | -0.2452865083 |
| Sum | 9214.35 |
| Variance | 0.5681731516 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.7 | 62 | 1.9% |
| 3.05 | 59 | 1.8% |
| 2.65 | 56 | 1.7% |
| 2.94 | 56 | 1.7% |
| 2.73 | 53 | 1.6% |
| 2.86 | 53 | 1.6% |
| 2.75 | 53 | 1.6% |
| 2.97 | 52 | 1.6% |
| 3 | 52 | 1.6% |
| 2.62 | 51 | 1.5% |
| Other values (152) | 2786 |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 0.3 | 1 | < 0.1% |
| 0.35 | 1 | < 0.1% |
| 0.54 | 2 | 0.1% |
| 0.57 | 2 | 0.1% |
| 0.59 | 1 | < 0.1% |
| 0.65 | 1 | < 0.1% |
| 0.68 | 1 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 0.73 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5.4 | 1 | < 0.1% |
| 5.1 | 1 | < 0.1% |
| 4.97 | 1 | < 0.1% |
| 4.94 | 1 | < 0.1% |
| 4.91 | 2 | |
| 4.86 | 3 | |
| 4.83 | 1 | < 0.1% |
| 4.81 | 2 | |
| 4.75 | 2 | |
| 4.73 | 3 |
| Distinct | 2227 |
|---|---|
| Distinct (%) | 66.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.44975398 |
| Minimum | 22.93 |
|---|---|
| Maximum | 96.15 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 22.93 |
|---|---|
| 5-th percentile | 42.338 |
| Q1 | 52.38 |
| median | 59.47 |
| Q3 | 66.48 |
| 95-th percentile | 76.516 |
| Maximum | 96.15 |
| Range | 73.22 |
| Interquartile range (IQR) | 14.1 |
Descriptive statistics
| Standard deviation | 10.50226075 |
|---|---|
| Coefficient of variation (CV) | 0.1766577664 |
| Kurtosis | 0.04789313179 |
| Mean | 59.44975398 |
| Median Absolute Deviation (MAD) | 7.03 |
| Skewness | -0.03479132737 |
| Sum | 198146.03 |
| Variance | 110.2974809 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55.05 | 6 | 0.2% |
| 63.56 | 5 | 0.2% |
| 52.98 | 5 | 0.2% |
| 64.88 | 5 | 0.2% |
| 58.31 | 5 | 0.2% |
| 63.43 | 5 | 0.2% |
| 58.03 | 5 | 0.2% |
| 52.5 | 5 | 0.2% |
| 67.25 | 5 | 0.2% |
| 65.25 | 5 | 0.2% |
| Other values (2217) | 3282 |
| Value | Count | Frequency (%) |
| 22.93 | 1 | |
| 23.25 | 1 | |
| 25.52 | 1 | |
| 25.87 | 1 | |
| 27.02 | 1 | |
| 27.08 | 1 | |
| 27.54 | 1 | |
| 27.77 | 1 | |
| 28.73 | 1 | |
| 30.04 | 1 |
| Value | Count | Frequency (%) |
| 96.15 | 1 | |
| 92.29 | 1 | |
| 92.2 | 1 | |
| 90.46 | 1 | |
| 90.12 | 1 | |
| 89.76 | 1 | |
| 89.31 | 1 | |
| 88.97 | 1 | |
| 88.66 | 1 | |
| 88.39 | 1 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3333 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2850 | |
| 1 | 483 | 14.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 2850 | |
| 1 | 483 | 14.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2850 | |
| 1 | 483 | 14.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3333 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2850 | |
| 1 | 483 | 14.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3333 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2850 | |
| 1 | 483 | 14.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3333 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2850 | |
| 1 | 483 | 14.5% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| account_length | voice_mail_messages | customer_service_calls | international_plan | day_calls | day_charge | evening_calls | evening_charge | night_calls | night_charge | international_calls | international_charge | total_charge | churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 128 | 25 | 1 | 0 | 110 | 45.07 | 99 | 16.78 | 91 | 11.01 | 3 | 2.70 | 75.56 | 0 |
| 1 | 107 | 26 | 1 | 0 | 123 | 27.47 | 103 | 16.62 | 103 | 11.45 | 3 | 3.70 | 59.24 | 0 |
| 2 | 137 | 0 | 0 | 0 | 114 | 41.38 | 110 | 10.30 | 104 | 7.32 | 5 | 3.29 | 62.29 | 0 |
| 3 | 84 | 0 | 2 | 1 | 71 | 50.90 | 88 | 5.26 | 89 | 8.86 | 7 | 1.78 | 66.80 | 0 |
| 4 | 75 | 0 | 3 | 1 | 113 | 28.34 | 122 | 12.61 | 121 | 8.41 | 3 | 2.73 | 52.09 | 0 |
| 5 | 118 | 0 | 0 | 1 | 98 | 37.98 | 101 | 18.75 | 118 | 9.18 | 6 | 1.70 | 67.61 | 0 |
| 6 | 121 | 24 | 3 | 0 | 88 | 37.09 | 108 | 29.62 | 118 | 9.57 | 7 | 2.03 | 78.31 | 0 |
| 7 | 147 | 0 | 0 | 1 | 79 | 26.69 | 94 | 8.76 | 96 | 9.53 | 6 | 1.92 | 46.90 | 0 |
| 8 | 117 | 0 | 1 | 0 | 97 | 31.37 | 80 | 29.89 | 90 | 9.71 | 4 | 2.35 | 73.32 | 0 |
| 9 | 141 | 37 | 0 | 1 | 84 | 43.96 | 111 | 18.87 | 97 | 14.69 | 5 | 3.02 | 80.54 | 0 |
Last rows
| account_length | voice_mail_messages | customer_service_calls | international_plan | day_calls | day_charge | evening_calls | evening_charge | night_calls | night_charge | international_calls | international_charge | total_charge | churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3323 | 117 | 0 | 5 | 0 | 126 | 20.13 | 97 | 21.19 | 56 | 10.22 | 3 | 3.67 | 55.21 | 1 |
| 3324 | 159 | 0 | 1 | 0 | 114 | 28.87 | 105 | 16.80 | 82 | 8.72 | 4 | 3.13 | 57.52 | 0 |
| 3325 | 78 | 0 | 2 | 0 | 99 | 32.88 | 88 | 9.94 | 109 | 10.95 | 4 | 2.51 | 56.28 | 0 |
| 3326 | 96 | 0 | 1 | 0 | 128 | 18.12 | 87 | 24.21 | 92 | 8.05 | 7 | 4.02 | 54.40 | 0 |
| 3327 | 79 | 0 | 2 | 0 | 98 | 22.90 | 68 | 16.12 | 128 | 9.96 | 5 | 3.19 | 52.17 | 0 |
| 3328 | 192 | 36 | 2 | 0 | 77 | 26.55 | 126 | 18.32 | 83 | 12.56 | 6 | 2.67 | 60.10 | 0 |
| 3329 | 68 | 0 | 3 | 0 | 57 | 39.29 | 55 | 13.04 | 123 | 8.61 | 4 | 2.59 | 63.53 | 0 |
| 3330 | 28 | 0 | 2 | 0 | 109 | 30.74 | 58 | 24.55 | 91 | 8.64 | 6 | 3.81 | 67.74 | 0 |
| 3331 | 184 | 0 | 2 | 1 | 105 | 36.35 | 84 | 13.57 | 137 | 6.26 | 10 | 1.35 | 57.53 | 0 |
| 3332 | 74 | 25 | 0 | 0 | 113 | 39.85 | 82 | 22.60 | 77 | 10.86 | 4 | 3.70 | 77.01 | 0 |